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SUMMARY 

The use of subband coding on image data is discussed. An overview of subband coding is 
given. Advantages of subbanding for browsing and progressive resolution are presented. Implementa- 
tions for lossless and lossy coding are discussed. Algorithm considerations and simple implementa- 
tions of subband systems are given. 


I. INTRODUCTION 

Subband coding is a promising strategy for data compression that has been used successfully in 
recent years. This paper discusses subband coding with the perspective of organizing data for storage 
and retrieval rather than as just a data compression strategy. An overview of subband coding is given 
followed by discussions of subbanding for archiving. Finally, some simple algorithms for subbanding 
image data are shown. 


Subband coding of speech was developed in the late 1970’s (ref. 1) and matured in the 1980’s. 
The key development for subbanding signals was the Quadrature Mirror Filter (QMF) in 1976 (ref. 2). 
A QMF bank cancels out the aliasing that is generated by filtering providing a good quality recon- 
struction of the signal. Subband coding was applied to image data in the mid-1980’s (refs. 3 to 5). 
The theory for perfect reconstruction filter banks (which not only cancel aliasing, but do not introduce 
amplitude or phase distortion) was also developed around this time (ref. 6). An overview of concepts 
important to subband coding can be found in references 7 to 9. 


II. SUBBAND CODING 

A generic subband coding system is shown in figure 1. The original signal is split up into N 
frequency bands (subbands) by the analysis filter bank represented by the matrix H(z . ). Each subband 
is then decimated, keeping only every Mh value. If the proper conditions are met, the resulting total 
number of values in the subbands is the same as in the original signal. No compression has been 
achieved yet; the subbanding process prepares the data for compression coding. The coders shown in 
each of the subbands could be the same or (more commonly) tailored to match the characteristics of 
the data in a particular type of subband. 

After the data has been coded, it is transmitted through the channel and received or, equivalently, 
stored and recalled. The data is decoded and the subbands sent through a synthesis filter bank, F(z), 
where the original signal is reconstructed. The synthesis filter bank has a special relationship with the 
analysis bank such that the aliasing caused by the analysis filters is canceled. The filter banks can be 


designed such that the reconstructed signal is a perfect replica of the original signal, provided that no 
errors have been introduced in the coding or transmission stages. 

To take advantage of the two-dimensional structure of image data, the filters used in the subband 
system should be two dimensional. Designing such filters is difficult, so separable filters have 
typically been used to accomplish two-dimensional processing. Separable filters use one-dimensional 
filters and apply them to the data in two directions, i.e., first horizontally and then vertically. This 
results in straightforward filter design, but requires storage to access horizontally processed data in a 
vertical direction (see fig. 2). 

An important property of image data is the importance of phase as a vehicle for information. A 
good example is obtained from reconstructions of images using only the Fourier transform phase 
compared with reconstructions using only the magnitude; the phase-only reconstructions are recogniz- 
able whereas the magnitude-only reconstructions are not (ref. 10). Linear phase filters are most 
commonly used for image data. Since most of the information is in the phase, frequency resolving 
capability is not as critical an issue with image data as it is with, say, audio data. This means that 
short kemal filters can be used on image data with good results. 

The advantages of using short kernal filters include simple hardware design and speed of 
algorithm execution. Short kernal implementations are attractive for on-line compression and for fast 
retrieval of stored data. 


III. BROWSING AND PROGRESSIVE RESOLUTION 

A property of the subband signals that makes the splitting of the original signal into lowpass, 
bandpass, and highpass bands useful for lossy compression is that most of the original signal energy 
winds up in the low frequency band. The higher frequency subbands can be compressed quite a bit 
without compromising the picture quality if the lowest subband is not degraded. 

The fact that the low frequency band is a lower resolution version of the original image makes 
subbanding attractive as a browsing tool. The low frequency band is generated as a consequence of 
subbanding and can be tracked separately from the higher frequency subbands in a browsing catalog. 
The low band can be retrieved and viewed separately. If a higher resolution version is desired, the 
higher frequency bands can be retrieved and image reconstructed. 

Color images are handled by subbanding each component separately. These components could 
be RGB, YUV, etc. For browsing, the most significant component can be used (e.g., luminance for 
color images) or all the components can be retrieved and viewed separately or recombined into a low 
resolution color image. Care must be taken when subbanding chrominance data or false colors can be 
generated in the low resolution (low frequency) band. 

Color image data can be stored as color components (RGB) or as luminance and chrominance 
data (e.g., YUV). The components are usually subbanded and. coded separately. A subband filter that 
averages pixels for the low frequency band can generate false colors around transitions. To avoid this 
it is possible to use a low frequency subband that is just a subsampled version of the original (see 
section V). 

A typical approach to generating multiple subbands is to cascade sets of two-channel filters in a 
tree structure. Although it is possible to design multiple-band filter banks, it is usually easier to just 
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cascade a simple two-band filter bank to get 4, 7, 16, or whatever number of bands is appropriate. A 
corresponding tree structure is used for the synthesis bank and this lends itself to progressive 
resolution reconstruction. 

The two most common tree structures are the octave-band split and the uniform-band split (see 
fig. 3). Either structure is equivalent from a browsing standpoint since the low frequency bands can 
be made as small as is practical for viewing using either tree. For progressive resolution, the octave- 
band split is most efficient in terms of reconstructing a series of increasing resolution images. 


IV. LOSSLESS AND LOSSY CODING 

Subbanding is a strategy for coding and as such can be used in either a lossless or lossy manner. 
The subbanding process itself can be made lossless by using perfect reconstruction filter banks. Of 
course, errors in the coding or transmission of the data will result in losses. The subbanding analysis 
and synthesis processes do not in themselves result in any data loss if properly designed. 

There are two places where losses can be intentionally introduced: (1) in the coders, and (2) in 
the way subband values are treated in terms of maintaining accuracy. Lossy coders deliberately throw 
out “data” that is irrelevant or is predominantly noise. The signal reconstructed from lossy coders 
does not (in most cases) exactly match the original signal. This “loss” is by design and in some cases 
can result in a reconstructed signal that is better than the original. The term “lossy” is unfortunate in 
that it conjures up thoughts of lost data in scientists’ minds when most of what is usually lost is noise. 

The second area where losses can be introduced is in the accuracy of subband values. A typical 
original signal sample size might be eight bits. When that signal is subbanded, it may take larger 
sample sizes in the subbands to maintain the accuracy needed for perfect reconstruction. For example, 
a simple two-channel split will require a minimum of 1/2 bit/sample larger average sample size to 
maintain accuracy. For a more conventional filter, the sample size requirements are much greater. 
Although the number of samples does not increase over the original signal due to subbanding, the total 
amount of data increases due to the required accuracy if perfect reconstruction is desired. 

An advantage of simple subbanding systems, such as the Walsh-Hadamard transform/subband 
system (WHT), is that the sample size does not have to increase very much to maintain perfect 
accuracy. For the two-dimensional, 2 by 2 WHT, the sample size only increases 2 bits to maintain 
perfect accuracy. For more complex subband filters double precision accuracy might be needed to 
avoid quantization errors (due to truncation or rounding). Section V presents a subband system which 
is even more efficient, in terms of sample size for lossless coding, than the WHT. 

For lossy coding the increase in subband value accuracy is not needed because these extra bits 
would just be dropped in the coding stage. A very effective coding scheme for image data uses 
simple coarse quantization of the high frequency bands for compression (ref. 1 1). 


V. SOME SUBBANDING ALGORITHMS 

It can be shown that simple, two-dimensional subband filters can be derived from block 
transforms (ref. 12). The filters are the basis pictures (formed by outer product of the basis vectors) of 
the corresponding transform matrix. The transform matrix (in the orthogonal case) is made up of 
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vectors formed from the coefficients of the analysis filters. For the Walsh-Hadamard transform 

1 1 


(WHT), with the transform matrix H - 


1 -1 


the two-dimensional filters are: 
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1 1 
1 1 
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1 1 

-1 -1 


*2 = 


1 -1 
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*3 = 


1 -1 

-1 1 


( 1 ) 


To subband an image with these filters, the image component data is broken up into 2 by 2 blocks and 
each block is multiplied (inner product) by each of the filters. For example, let be a 2 by 2 block 
(matrix) of luminance pixel values. The low band value, Sq, corresponding to that block is: 
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d 0 
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The other subband values are: 


5i - do + dj - d 2 - d 3 


5 2 — do ~ d| "i" d 2 - d 3 

5 3 = do - di - d 2 + d 3 


( 2 ) 


After these subband values have been calculated, a new 2 by 2 block of data is read in. If the 
subband values are kept in a block, then a traditional block transform is obtained. If the subband 
values are grouped separately (e.g., all the S Q values in one group), then a subband representation is 
obtained. The subband values should be scaled (by dividing by 4) and shifted to a positive range if it 
is desired to view them. If all of the low band (S 0 ) values are viewed as an image that is y 2 by y 2 the 
original size, then a low resolution, low pass version of the original image is obtained. The higher 
bands contain edge information. 


Since the inverse transform matrix is the same as the forward WHT matrix, the filters for 
reconstruction are also the same. To get the reconstructed data block, l) j, the following equations are 
used (including the overall transform scaling factor of % here): 

d 0 = % (Sq + + >S 2 + S 3 ) 

= M (Sq + - $2 - 5^) 

S 2 = Vi (Sq - Sj + S 2 - S 3 ) 
a 3 = Vi (S 0 - Sj - s 2 + S 3 ) 

Cascading the algorithm is equivalent to performing a high order subband/transform. For example, 
performing a 4 channel subbanding on data that has already been split into 4 channels results in a 
16 channel split that is equivalent to a 4 by 4 WHT. 
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Another subband system (ref. 12) uses the following nonsymmetric transform: 



1 -1 


0 1 
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The analysis filters are given by the basis pictures of G : 


*0 


0 0 
o 1 ; 




0 1 

0 -1 * 


B 2 


0 0 
.1 -lj ; 


*3 


1 -1 

-1 1 


( 3 ) 
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This system has been characteristic that the low band is a subsampled version of the original. This is 
useful for chrominance data where an averaging function could create false colors in the low band. 
This algorithm is also simpler, requiring fewer additions. It is not a symmetric transform, however, 
and the following filters are used for reconstruction: 
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The equations for analysis and synthesis corresponding to (4) and (5) are: 
Analysis Synthesis 
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VI. SIMULATION RESULTS 

Some examples of subbanding an image are shown in figures 4 to 10. Figure 4 shows the 
original image “Lenna” which was taken from the USC unofficial standard image set. The image is 
512 by 512 pixels (but appears stretched horizontally due to the display system used) and was con- 
verted to monochrome from a color original. 

Figure 5 shows a four band split of the original using G from (3). The low band in the upper 
left comer is a subsampled version of the original while the high band in the lower right comer has 
very little energy. Figure 6 is a close-up of the low band which makes the low resolution artifacts, 
such as staircase edges, more visible. 

The 16 band split shown in figure 7 was obtained by cascading two 4 band splits. The data 
shown in figure 5 was subbanded again using the same algorithm. A third cascade of the 4 band 
algorithm produced the 64 band split shown in figure 8. A close-up of the low band is shown in 
figure 9. The low band is a very low resolution version of the original, but it is usable as a browse 
picture and is only 64 by 64 pixels in size. 
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Figure 10 shows a perfect reconstruction of Lenna from a four band split using the inverse 
transform g from (3). 

To investigate the effect of subbanding on lossless compression of images, some subbanded 
image files were compressed with a public domain program that is based on the Lemple-Ziv coding 
method. Files subbanded into 4, 16, and 64 bands using both H and G transform matrices were 
compressed and compared to the original image file size. The original images did not compress well 
with the LZ coder. 

At first, the subbanded values were truncated to 8 bits before compression, resulting in a lossy 
compression. Since the LZ coder is designed to work on text characters and since the bits that were 
dropped are expected to contain mostly noise, this approach seemed a logical first step. 

The results are shown in table I. The more subbands an image is separated into, the better the 
compression. The quality of the reconstructed image remains very good since only a small amount of 
information is lost in truncation. The H transform out performed the G because when the values were 
truncated, more signal energy was removed in the H transform case (i.e., there is less loss in the G 
case). A comparison of the subband value size between H and G is shown in figure 1 1. 

For true lossless compression the full value for each subband must be kept. This is most easily 
done by storing each value as a 16 bit word. Since the lowest band of a G transformed image is still 
only 8 bits, that band can still be stored in bytes. Using H , all bands have the same value size 
(dynamic range) and so require an extra 2 bits per value for every subband stage. 

Since noise is not generally compressible, we would expect the lossless compression ratios to be 
worse than the truncated ratios of table I. Table II shows the lossless case for the four band sub- 
banded images. Although the H transformed data actually expands instead of contracts, improvements 
could be easily obtained by packing the data properly before compressing. For example, adding the 
extra 2 bits per value (for a total of 65 536 bytes) to the four-band truncated compressed file in table I 
yields a total file size of 263 630 for the Baboon image. The G transform does much better, probably 
due to the 8 bit low band values. 

The results of table I show that subbanding image data can improve the effectiveness of LZ 
compression. The more the data is subbanded, the better the compression ratios. For the truncated 
values used in these tests, the H transform matrix outperformed the G because more noise was 
removed in truncation. 

The results in table II show, in a preliminary way, the advantage of the G matrix in lossless 
coding. The G transform produces an average of 1 bit/pixel smaller value size than H which gives it a 
head start in compression. The values in table II show a trend, but the bits have not been packed in 
an efficient fashion. However, computer based methods tend to use byte and 16 bit word size values. 
Dense packing of bits requires more manipulation. 

The interesting case of the Baboon image subbanded with G compressing better when the higher 
bands are stored as 16 bit values than truncated bytes is probably due to random chance. G out 
performs H for lossless compression in this simple case. Future research will be into more efficient 
packing methods. 
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VII. CONCLUDING REMARKS 


Subbanding image data can be advantageous for browsing and progressive resolution in addition 
to data compression. Subband systems can be designed to provide perfect reconstruction. The use of 
short kemal filters allows fast, simple implementations for real-time use or fast retrieval. 

Two methods of subbanding image data with simple filters were presented. These methods can 
be expanded or cascaded to work on larger block sizes easily (ref. 12). Subbanding image data is a 
promising method of obtaining better performance from lossless coders. 
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TABLE I.— LOSSY COMPRESSION RESULTS USING AN LZ CODER ON 


SUBBANDED IMAGES WITH ALL VALUES TRUNCATED TO 8 BITS 



File size 

Image 

Number of 

Transform 

Original 

Compressed 

Ratio 


bands 

matrix 





Lenna 

1 



262 148 

228 196 

1.15:1 


4 

H 



166 343 

1.58:1 


4 

G 



173 011 

1.52:1 


16 

H 



121 418 

2.16:1 


16 

G 



142 832 

1.84:1 


64 

H 



88 383 

2.97:1 


64 

G 



125 100 

2.10:1 

Baboon 

1 





239 005 

1.10:1 


4 

H 



198 094 

1.32:1 


4 

G 



202 833 

1.29:1 


16 

H 



166 275 

1.58:1 


16 

G 



184 933 

1.42:1 


64 

H 



136 257 

1.92:1 


64 

G 



171 851 

1.53:1 


TABLE II.— LOSSLESS COMPRESSION RESULTS USING AN LZ CODER ON 
SUBBANDED IMAGES SAVED AS 16 BIT VALUES EXCEPT LOW 
BAND OF G TRANSFORM WHICH ARE SAVED IN BYTES 



File size 

Image 

Number of 

Transform 

Average pixel 

Original 

Compressed 

Ratio a 


bands 

matrix 

size, 








bits 




Lenna 

L 

\ 

H 

10 

524 292 

303 509 

0.86:1 




G 

9 

458 756 

254 594 

1.03:1 

Baboon 



H 

10 

524 292 

371 813 

0.71:1 




G 

9 

458 756 

196 535 

1.33:1 


n Ratio of original 262 148 byte file to compressed. 
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Figure 1 . — A generic four-channel subband coder. 


Horizontal filtering 


Vertical filtering 


Input 



Figure 2. — A separable, four-channel subband analysis bank. 
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(a) Uniform bands. 



(b) Octave bands. 

Figure 3. — Subband analysis tree structures. 




Figure 4. — Original image of girl, "Lenna". 


Figure 5. — Four band split of Lenna using the non-orthogonal 
transform of (3). 
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Figure 6. — Close-up of the low band of figure 4. 


Figure 7. — Sixteen band split of Lenna obtained by applying the 
transform of (3) to the data shown in figure 5. 



Figure 8. — Sixty-four band split of Lenna obtained by applying the Figure 9. — Close-up of the low band of figure 8. 

transform of (3) to the data shown in figure 7. 



C-91-11206 
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Figure 10. — Perfect reconstruction of Lenna following a four band 
split. 
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Figure 11. — Pixel value size by subband for G and H transforms, cascading the four-band analysis. 
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